Rank in Wordlist | Frequency | Word |
---|---|---|
33775 | 6 | 1,5 |
53059 | 3 | 0,6 |
53115 | 3 | 2,5 |
53139 | 3 | 24,9 |
53188 | 3 | 7,2 |
67245 | 2 | 0,3 |
67246 | 2 | 0,4 |
67247 | 2 | 0,4% |
67248 | 2 | 0,5 |
67256 | 2 | 1,3 |
Rank in Wordlist | Frequency | Word |
---|---|---|
38171 | 5 | %. |
38172 | 5 | %، |
44220 | 4 | 85% |
53145 | 3 | 28% |
67247 | 2 | 0,4% |
67262 | 2 | 10% |
67367 | 2 | 20% |
67426 | 2 | 3% |
67462 | 2 | 4% |
67491 | 2 | 50% |
Rank in Wordlist | Frequency | Word |
---|---|---|
98566 | 1 | L'organigramme، |
98711 | 1 | c'est |
98728 | 1 | d'intérêts |
98729 | 1 | d'intérêt، |
98771 | 1 | l'évolution |
108815 | 1 | أفطرت''، |
120878 | 1 | السحر''، |
120887 | 1 | السحور''، |
138031 | 1 | بـ'المتينة |
163387 | 1 | قاعدون'؟ |
Rank in Wordlist | Frequency | Word |
---|---|---|
53122 | 3 | 2008/2009 |
67261 | 2 | 1/1 |
67374 | 2 | 2009/2010 |
67375 | 2 | 2009/2010، |
67378 | 2 | 2010/2011، |
67380 | 2 | 2011/2012، |
67517 | 2 | 6/ |
71188 | 2 | الاثنين/ |
72074 | 2 | الجديدة/القديمة |
96666 | 1 | 00/07 |
In the last subsection of this type we look for words containing other special characters: , ( ) % & $
" ' + * = / _
Depending on the language some of these characters may be allowed within words, other will not. If words with forbidden characters do not have very low frequency there might be a problem in preprocessing.
Words containing %:
select w_id-100,freq, word from words where w_id>100 and word like "%\%%" limit 10;
3.12.1 Words with Hyphens
3.12.2 Multiwords
3.12.3 (Multi-)Words with dots